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A new concept of within-individual epidemiology termed "myEpi" is introduced. It is 
argued that traditional epidemiological methods, which are usually applied to populations 
of humans, can be applicable to a single individual and thus used for self-monitoring and 
forecasting of "epidemic" outbreaks within an individual. Traditional epidemiology requires 
that results be generalizable to a predefined population. The key component of myEpi is 
that a single individual may be viewed as an entire population of events and thus, the analy- 
sis should be generalizable to this population. Applications of myEpi are aimed for, but not 
limited to, the analysis of data collected by individuals with the help of wearable sensors 
and digital diaries. These data can include physiological measures and records of healthy 
and risky behaviors (e.g., exercise, sleep, smoking, food consumption, alcohol, and drug 
use). Although many examples of within-individual epidemiology exist, there is a pressing 
need for systematic guidance to the analysis and interpretation of intensive individual-level 
data. myEpi serves this need by adapting statistical methods (e.g., regressions, hierarchical 
models, survival analysis, agent-based models) to individual-level data. 

Keywords: mobile health, epidemiology, statistics, data science, myEpi, wearable devices, evidence-based practice, 
self-care 



EPIDEMIOLOGY AND myEpi 

Epidemiology, literally "the study of what is upon the people," is 
defined as the science of the incidence, distribution, and control 
of disease conditions in a population (1). Epidemiology provides 
public health and policy makers with evidence such as risk fac- 
tors for disease and targets for preventive medicine. Epidemiology 
does not focus on a specific individual; treating disease conditions 
of an individual is traditionally left to the clinical field. At the 
same time, more individuals are showing active interest in their 
own health; and more physicians are interested in incorporating 
the wealth of existing information into their practices. When the 
focus is on the health and behavior of a specific individual, use 
of epidemiological methods can lead to confusion. On one hand, 
evidence-based principles state that practical decisions should be 
based on research studies. In fact, knowing that smoking leads to 
a plethora of health conditions including cancer is a good motiva- 
tion for an individual to quit smoking, and for a never-smoker not 
to initiate. On the other hand, overreliance on population-level 
evidence can lead to a well-known ecological fallacy when health 
risks (or lack thereof) are erroneously projected onto an individ- 
ual. For example, lawsuits have been initiated when a physician 
relies on evidence and assumes low risk and a patient develops a 
health condition (2). 

In this article, I introduce the concept of individual epidemi- 
ology (myEpi), which aims to provide individual-level inference. 
By definition, myEpi considers an individual as a population of 
person-level events and behaviors (Figure 1) and thus inferences 
about an individual can be obtained from within-person data. 

MyEpi does not ignore evidence collected from others in the 
population, in fact it embraces these data in the same way epidemi- 
ological studies embrace environmental factors. In this sense, the 



polluted environment of a faulty chemical plant could contribute 
to the increased risk of disease in a neighboring town, similar to 
the way disease status of close relatives (genetics) or social net- 
works (behavioral influence) contributes to an increase in disease 
risk for an individual. 

AVAILABLE DATA 

Over the last decade, a revolution in data collection within an 
individual (e.g., electronic diaries, ecological momentary assess- 
ments, and multiple physiological sensors) has allowed for the 
collection of large amounts of "intensive" data (3). Although tra- 
ditional methods such as Timeline Followback (4) are still being 
used for research purposes, web, and mobile applications allow 
one to record and analyze data in real time. Although the design, 
quality, and ownership of the data remain subjects of evolving 
research and policies, one thing is evident: data collected by an 
individual and for the use by that individual can have a strong 
impact on that person's health and behavior. 

Statistical requirements for myEpi data follow the same statisti- 
cal power rules as a traditional epidemiological study; a longitudi- 
nal study with sufficient observations could potentially be used for 
helping a specific patient. In decision sciences, the requirements 
for the amount of data can be relaxed in exceptional cases when 
individual data carry critical information. For example, in a hypo- 
thetical situation three antibiotics (A, B, and C) are available to 
treat a patient. If a patient developed severe allergic reaction in the 
past to drugs A and B on two separate occasions, then the only 
available option is C. This decision is likely to be made regardless 
of whether population-level clinical trials show that drug B is most 
efficacious. In this example, the presence of only two observations 
is sufficient to rule out two of three antibiotics. 
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FIGURE 1 | An illustration of inferring to an individual form the 
population-level and individual-level data. The green "myEpi" oval 
denotes within-person data which could be also combined with information 
about blood relatives, social networks, as well as the entire population and 
the environment. The purpose of the analysis, however, is the 
individual-level inference. 



This example also illustrates that data for medical decision pur- 
poses should be of high quality. In reality, without the availability 
of electronic health records, healthcare providers often rely on 
patients' recollections of past events, often with a long time lag 
and thus reduced reliability. Additionally, medical records often 
do not include events that happen outside health care. In addition 
to and sometimes instead of formal healthcare individuals some- 
times rely on personal opinion and experience and the advice of 
family, influential social network members, advertisement, etc. (3, 
6). Recordings of self-treatment and outcomes become more avail- 
able and should be used to provide insights into individuals' health 
and behavior. 

AVAILABLE ANALYSIS METHODS 

These within-person data are different from data traditionally 
collected in epidemiological studies, but myEpi can adapt most 
methodology from traditional epidemiological methods. Below 
are a few examples. When collecting multiple observations per 
subject, epidemiologists have studied between- and within-subject 
variation for decades and used regression-based tools like Hierar- 
chical Linear Models. In epidemiological studies within-subject 
variation represents the average over many individuals rather than 
the variation within a single individual. In myEpi, the focus is 
on variability within an individual (3, 7-9). Bayesian methods 
open an opportunity to combine evidence from population- 
level research with observed data from a specific individual, 
thus borrowing strength from the group where actual data are 
lacking (10). 

Infectious diseases such as measles and influenza have been 
extensively studied using epidemic surveillance and mathematical 
models (11). These models are based on daily or regular incidence 




Date 



FIGURE 2 | A "moving windows" method to identify patterns of 
alcohol use trajectories. Distributional properties of sliding windows W, 
and 1/V, +S _, are compared to each other. The point when the distributions 
become significantly different signifies the change in patterns. We illustrate 
the point at which the pattern switched from type 5 to 7 as the number of 
drinks increases. The figure is reproduced with permission from Ref. (8). 



reports and if model parameters are well calibrated the model can 
predict the course of an epidemic (11-13) and evaluate strategies 
to contain it (12, 13). The same methods, such as distinguish- 
ing an emerging epidemic from occasional outbreaks, can be used 
when considering individual data. The application again requires 
a change of mindset. Similar to considering regular (e.g., weekly or 
monthly) disease incidence reports one can consider regular (e.g., 
daily or weekly) reports of specific events. These events could be 
categorical (e.g., exercised or not), count (e.g., drank five beers), or 
continuous (e.g., consumed 3680 calories). In substance use stud- 
ies, a number of tools are available to track alcohol and tobacco 
consumption; drawing an analogy with an infectious epidemic, 
the tools can detect the start of increased use (8) (Figure 2), pre- 
dict most likely moment for relapse (14), predict future use (15), 
and identify strategies to influence the recovery process (16, 17). 

Survival analysis has been broadly used to estimate exposure 
risks in the population and survival curves can show, for exam- 
ple, survival from HIV among men who have sex with men in a 
long-term cohort study ( 1 8 ) . Although the study emphasized that 
condoms and antiretroviral treatment (ART) work as preventive 
factors, it tells little about individuals' chances to seroconvert. At 
the same time, knowledge of individual sexual and drug-using 
behavior in discordant couples allows one to estimate the timing 
of HIV seroconversion (7). Changes in individual behavior (e.g., 
compliance with ART) alter the HIV-negative survival of a partner 
(Figure 3). 

For complex problems involving health systems and adaptive 
behavior computational methods become very popular (9, 19). 
Agent-based models (ABMs) have recently become powerful tools 
to estimate the effects of interventions on community and pop- 
ulation health (20, 21). At the same time, ABMs allow one to 
track an individual and examine what could potentially happen 
to this specific individual over the course of time under different 
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Number of month with 15 unprotected acts 



FIGURE 3 | Survival curves for staying HIV negative for a female with a male HIV-positive partner in a latent HIV stage, assuming two unprotected 
vaginal intercourses per week. Model details and parameters are described in detail in Ref. (7). 



scenarios (22). Modeling an individual in the context of social 
network and a series of individual events is in its infancy, espe- 
cially when considering health outcomes. However, this area has 
been progressively developed in the context of military actions and 
response to terrorism (23). 

INTERPRETATION OF DATA SUMMARIES AND BEHAVIOR 
CHANGE 

Besides professionally designed and analyzed studies, individuals 
collect large amounts of data for their personal interpretation and 
decision making. Machine learning tools that are being developed 
as part of the "Big Data" research agenda can provide people with 
real-time interpretations of their data. Incorporation of these tools 
in the backbone of mobile apps is likely to play a crucial role in 
the future. At the same time, understanding probability, associa- 
tions, causality, and uncertainty to the users of thousands of web 
and mobile applications becomes critical because these interpre- 
tations affect health and behavior decisions. Just as a public health 
policymaker needs to understand and interpret epidemiology, an 
individual needs to understand and interpret individual data for 
personal decisions. 

An individual can also compare him- or her-self to other indi- 
viduals, trends, and percentiles in the available sample. In this 
sense, such analysis is similar to how cross-cultural and cross- 
country comparisons are made in epidemiology. However, myEpi 
emphasizes considering these trends and percentiles only as a 
context for an individual trend. For example, if an individual is 
monitoring his or her blood pressure he or she might also like to 
compare it with national statistics, daily and annual trends, etc. 
However, the most important comparison is with norms for self. 
Some people can naturally have lower blood pressure and a sud- 
den jump in pressure can be worrisome even if it remains below 
the national norm. Analogously, an infant can develop perfectly 
well according to her own growth curve, which might be in a low 
or high national percentile. 



CONTROL OF PERSONAL HEALTH 

Control of national epidemics has been a prerogative of the gov- 
ernment's public health system. Control of individual health, 
however, has been more ambiguous, with a large role placed on 
physicians and health care providers in general but mostly for the 
treatment of disease. Prevention has traditionally been a part of 
individuals' everyday life. At the same time, public health officials 
and physicians have been trained for years to recognize patterns 
within populations, groups of patients, and a single individual, 
while the majority of individuals are not trained to monitor them- 
selves in a systematic way. This lack of statistical training often 
results in superstitions and ungrounded beliefs. Currently, the 
availability of data collecting and monitoring tools could poten- 
tially lead to more ungrounded beliefs and erroneous conclusions 
if rigorous and reliable help is not provided to individuals for 
analysis of their own data. 

LINK BETWEEN EPIDEMIOLOGY AND myEpi 

One of the disputed issues in the use of individual data is their 
generalizability to a larger population. Under the myEpi con- 
cept, models and methods should not necessarily be generaliz- 
able to the population (it would be an additional merit if they 
were) but should show validity within an individual. This distinc- 
tion becomes increasingly important in the process of funding 
and publication reviews and more discussion of good analysis 
practices are needed. Although myEpi is focused on the indi- 
vidual, it has important potential in the improvement of pub- 
lic health by acting on one person at a time. The application 
areas could be different (e.g., monitoring sleep, blood pressure, 
food and exercise, tracking cigarettes smoked) and the results 
will vary drastically by the application, tool, and individual. 
However, my hypothesis is that those individuals who use myEpi 
(i.e., monitor their health using intelligent tools and rigorous 
methods) would improve their well-being more than those who 
do not. 



www.frontiersin.org 



August 2014 | Volume 2 | Article 97 | 3 



Bobashev 



myEpi. Epidemiology of one 



REFERENCES 

1. Webster Dictionary. Available from: http://www.rnerriam-webster.com/ 
dictionary/epidemiology 2014 

2. Hurwitz B. How does evidence based guidance influence determinations of 
medical negligence? BMJ (2004) 329:1024-8. doi:10.1136/bmj.329.7473.1024 

3. Walls TA, Schafer TL. Models for Intensive Longitudinal Data. New York, NY: 
Oxford University Press (2006). 

4. Sobell L, Sobell M. Timeline follow-back: a technique for assessing self-reported 
alcohol consumption. In: Litten R, Allen J, editors. Measuring Alcohol Consump- 
tion. Clifton, NJ: Humana Press (1992). p. 41-72. 

5. Robert Wood Johnson Foundation. Health Care's Blind Side: The Overlooked 
Connection Between Social Needs and Good Health. Princeton, NJ: Robert Wood 
Johnson Foundation (2011). 

6. Smedberg A. E-Health Communities and Online Self-Help Groups: Applications 
and Usage. Sweden: Stockholm University (2011). 

7. Bobashev G, Norton J, Wechsberg WM, Toussova O. Are you HIV invincible? 
A probabilistic study of discordant couples in the context of HIV transmission. 
PLoS One (2014) 9(5):e94799. doi:10.1371/journal.pone.0094799 

8. Bobashev G, Liao D, Hampton J, Helzer J. Individual patterns of alcohol use. 
Addict Behav (2014) 39:934-40. doi:10.1016/j.addbeh.2013.12.004 

9. Ridenour TA, Pineo TZ, Maldonado- Molina MM, Hassmiller Lich K. Toward 
rigorous idiographic research in prevention science: comparison between three 
analytic strategies for testing preventive intervention in very small samples. Prev 
Sci (2013) 14(3):267-78. doi:10.1007/slll21-012-0311-4 

10. Winkler RL. Introduction to Bayesian Inference and Decision. Gainesville, FL: 
Probabilistic Publishing (2003). 

11. Available from: http://portaldev.rti.org/10_Midas_Docs/infoMaterials/ 
MIDAS_Overview.pdf 

12. Ferguson NM, Cummings DAT, Fraser C, Cajka CC, Cooley PC, Burke DS. 
Strategies for mitigating an influenza pandemic. Nature (2006) 442:448-52. 
doi:10.1038/nature04795 

13. Epstein JM, Goedecke M, Yu F, Morris RJ, Wagener DK, Bobashev GV. Control- 
ling pandemic flu: the value of international air travel restrictions. PLoS One 
(2007) 2:401. doi:10.1371/journal.pone.0000401 

14. Witkiewitz K, Marlatt A. Modeling the complexity of post-treatment drinking: 
it's a rocky road to relapse. Clin Psychol Rev (2007) 27:724-38. doi: 10.1016/j. 
cpr.2007.0 1.002 

15. Warren K, Hawkins RC. Multiscale nonlinearity in a time series of weekly alcohol 
intake. Psychol Rep (2002) 90:957-67. doi:10.2466/pr0.2002.90.3.957 

16. Quitnet. Available from: http://www.quitnet.com/ 



17. Gustafson DH, McTavish FM, Chih MY, Atwood AK, Johnson RA, Boyle 
MG, et al. A smartphone application to support recovery from alcoholism: 
a randomized clinical trial. JAMA Psychiatry (2014) 71:566-72. doi:10.1001/ 
jamapsychiatry.20 1 3.4642 

18. Plankey MW, Ostrow DG, Stall R, Cox C, Li X, Peck JA. The relationship 
between methamphetamine and popper use and risk of HIV seroconversion in 
the multicenter AIDS cohort study. J Acquir Immune Defic Syndr (2007) 45(1): 
85-92. doi:10.1097/QAI.0b013e3180417c99 

19. Sterman JD. Learning from evidence in a complex world. Am J Public Health 
(2006) 96(3):505-14. doi:10.2105/AJPH.2005.066043 

20. El-Sayed AM, Scarborough P, Seemann L, Galea S. Social network analysis and 
agent-based modeling in social epidemiology. Epidemiol Perspect Innov (2012) 
9:1. doi:10.1186/1742- 5573-9-1 

2 1 . Hoffer L, Bobashev GV. Researching a local heroin market as a complex adaptive 
system. Am I Community Psychol (2009) 44:273-86. doi:10.1007/sl0464-009- 
9268-2 

22. Hoffer L, Bobashev GV, Morris RJ. Simulating patterns of heroin addiction 
within the social context of a local heroin market. In: Gutkin B, Ahmed S, edi- 
tors. The Computational Neuroscience of Drug Addiction. New York: Springer 
Verlag (2011). p. 313-31. 

23. Zacharias G, MacMillan J, Van Hemmel SB, editors. Behavioral Modeling and 
Simulation: From Individuals to Societies. Washington DC: National Academies 
Press (2008). 

Conflict of Interest Statement: The author declares that the research was conducted 
in the absence of any commercial or financial relationships that could be construed 
as a potential conflict of interest. 

Received: 13 June 2014; paper pending published: 20 June 2014; accepted: 11 July2014; 
published online: 12 August 2014. 

Citation: Bobashev G (2014) myEpi. Epidemiology of one. Front. Public Health 2:97. 
doi: 10.3389/fpubh.2014.00097 

This article was submitted to Epidemiology, a section of the journal Frontiers in Public 
Health. 

Copyright © 2014 Bobashev. This is an open-access article distributed under the terms 
of the Creative Commons Attribution License (CC BY). The use, distribution or repro- 
duction in other forums is permitted, provided the original author(s) or licensor are 
credited and that the original publication in this journal is cited, in accordance with 
accepted academic practice. No use, distribution or reproduction is permitted which 
does not comply with these terms. 



Frontiers in Public Health | Epidemiology 



August 2014 | Volume 2 | Article 97 | 4 



